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Abstract 

The paper reports on first preliminary results and in- 
sights gained in a project aiming at implementing the 
fluent calculus using methods and techniques based on 
binary decision diagrams. After reporting on an initial 
experiment showing promising results we discuss our 
findings concerning various techniques and heuristics 
used to speed up the reasoning process. 
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Introduction 

In recent years we have seen highly advanced and 
novel implementations of propositional calculi and sys- 
tems like, for example, GSat and its variants (Sel- 
man, Levesque, & Mitchell 1992), Smodels (Niemela, 
& Simons 1997) or DLV (Eiter et al. 1998), to men- 
tion just a few. The implementations were applied 
to many interesting fields in Intellectics like, for ex- 
ample, planning or non-monotonic reasoning. On the 
other hand, few results are reported so far on apply- 
ing another propositional method in these fields, viz., 
model checking using binary decision diagrams (BDDs) , 
with Cimatti et. al (1997; 1998; 1999) and (Edelkamp 
& Reffel 1999a) being an exception. This comes to 
a surprise because model checking using binary deci- 
sion diagrams has significantly improved the perfor- 
mance of algorithms and enabled the solution of new 
classes of problems in related areas like formal verifi- 
cation and logic synthesis (see e.g. (Burch et al. 1992; 
1994)). Can we adopt the technology developed for 
model checking of finite state machines using binary 
decision diagrams for the solution of planning problems 
and, more generally, problems occurring in reasoning 
about situations, actions and causality? Can we enrich 
these techniques by exploiting the experiences made in 
the state of the art implementations of propositional 
logic calculi and systems mentioned at the beginning of 
this paragraph? 

In order to answer these and related questions a 
sound and complete mapping from (a fragment of) 
the fluent calculus (Holldobler & Schneeberger 1990; 
Thielscher 1998a) to propositional logic is specified in 



(Holldobler & Storr 1999) such that the entailment 
problem in the fluent calculus can be solved by finding 
models for the corresponding propositional logic for- 
mula. The propositional logic formulae are represented 
by reduced and ordered binary decision diagrams and 
techniques from model checking are applied to search 
for models. Our mapping relics on three properties of 
the considered fragment of the fluent calculus: 

• The set of states is characterized by a finite set of 
propositional fluents, i.e., a set of propositional vari- 
ables, which can take values out of {T, _L} . 

• The actions are deterministic and their preconditions 
as well as effects depend only on the state they are 
executed in. 

• The goal of the planning problem is a property which 
depends solely on the reached state. 

Here we report on initial results, findings and insights 
gained with the BDD-based implementation of the flu- 
ent calculus. After briefly discussing the fluent calculus 
and the implementation using an example from the so- 
called GRiPPER-class, we concentrate on two heuristics 
and techniques which can be applied to speed up the 
solution of the planning problem. In particular, we dis- 
cuss some results on variable ordering and partitioning 
of the transition relation. 

For the convenience of the reader we give a very 
brief review of some basic notions of the fluent calcu- 
lus and the concept of BDDs, but for a detailed intro- 
duction into the matter we refer to (Thielscher 1998a; 
Holldobler & Storr 1999) and (Bryant 1986) as refer- 
ences respectively. 

Gripper Planning Problems 

In a contest held at AIPS98, planners had to solve var- 
ious problems, among which were the problems of the 
so-called Gripper class: 

A robot equipped with two grippers G\ and G2 can 
move between two rooms A and B . Initially the 
robot is in room A together with a number of balls 
Bi, . . . , B n . The task is to transport these balls 
into room B . 



We will specify Gripper class problems in the fluent 
calculus in a moment. Before doing so, however, some 
notational conventions are helpful. Words starting with 
an upper letter denote constants, whereas words start- 
ing with a lower letter denote predicate symbols, non- 
nullary function symbols and variables. Additionally 
we assume that each variable a denotes an action, s a 
situation, / a fluent and z a state (i.e. are variables 
of the corresponding sorts action, sit, fluent, state 
in the fluent calculus, see (Thiclschcr 1998a; Holldobler 
& Storr 1999)). All symbols may be indexed. 

In the fluent calculus situations are denoted by So 
standing for the initial situation, and by use of the 
function do(a,s) , denoting the situation after execu- 
tion of an action a in an situation s . States are de- 
noted by combining the fluents, which hold in the state, 
with the associative and commutative binary operation 
symbol o , effectively representing multisets of fluents. 
In the case of propositional fluents, as considered in 
this paper, the states contain each fluent at most once. 
represents the empty state. Thus, o fulfills the prop- 
erties: 1 

(z 1 oz 2 )oz 3 = z 1 o(z 2 oz 3 ) 

z\o z 2 — z 2 o z\ (AC1) 
z o = z 

A normal form to write ground state terms are the so- 
called constructor state terms of the form 0o/ 1 o. . .o/„ , 
n > where the /j's are pairwise distinct. 

state(s) denotes the state holding in a situation s . 
We also make frequently use of the abbreviation 

holds (f, s) = (3z) state(s) = f o z . 

The initial state of a reasoning problem in the fluent 
calculus is specified by an axiom of the form 

T Sa = {state(S ) = t}, (1) 

relating the initial situation So to a state t represented 
as an constructor state term. If an equation like (1) is 
given, then &i(z) denotes the equation z = t. Turning 
to the example, the initial state of a Gripper class 
problem is specified by 

T Sa = {state(S ) = %oat(B u A)o ... o at(B n ,A) 
o free(Gi) o free(G 2 ) o at-robby(A)} , 

where n is instantiated to some number. 

The fluent at(b, r) states that ball b is at room r , 
free(g) states that gripper g is free and at-robby(r) 
states that the robot is at room r . 2 

There are three actions in the Gripper class: 

• the robot may move from one room to the other. 

• the robot may pick up a ball if it is in the same room 
as the ball and one of its grippers is empty. 

1 Frce variables in formulae are assumed universally quan- 
tified, unless otherwise stated. 

2 Formally, b , r , g denote variables of new sorts ball , 
ROOM and GRIPPER . 



• the robot may drop a ball if it is carrying one. 

These actions are specified by means of state update 
axioms, which relate a state state(s) and the state 
state(do(a, s)) after executing an action a. The gen- 
eral form for state update axioms is as follows: 

A(s) — > state(do(a, s)) o <3~ = state (s) o i9 + , 

where z9 + are positive Effects of a , i.e. the fluents 
which did not hold before and will hold after executing 
the action, -&~ negative Effects and A(s) is the con- 
dition under which the action has exactly these Effects. 
For technical reasons we include an action noop which 
leaves the state unchanged. 

J~su— {holds (at-robby(r i), s) A ^holds(at-robby(r 2 ), s) 
— > state{do{move{ri 1 r 2 ) 1 s)) o at-robby(r) 

= state(s) o at-robby(r 2 ) , 

holds (at(b , r) , s) A holds (at- r obby (r), s) 
A holds (free(g), s) A -^holds (carry <{b, g), s) 
— > state(do(pick(b, r, g), s)) o at(b, r) ofree(g) 
= state (s) o carry (b, g) , 

holds (carry (b, g), s) A holds (at-robby(r), s) 
A ->holds(at(b,r), s) A ->holds(free(g),s) 
— > state(do(drop(b, r, g), s)) o carry(b, g) 

= state (s) o at(b, r) o free(g) , 

state(do(noop, s)) — state(s) } 

The states, which may occur in a planning problem, 
are constrained the following axiom, which states, that 
fluents cannot occur twice in a state: 

Tms = {(Vs, z) ->(3g) state(s) = g o g o z] . 

The complete fluent calculus axiomatization J 7 of a 
planning problem consists of the axioms mentioned 
above, as well as some additional axioms, whose dis- 
cussion is out of the scope of this paper, namely a set 
Fmset of equational axioms containing (AC1), which 
define the semantics of o , as well as a set of unique 
name assumptions T un for the sort fluent . Please 
refer to the given literature for a detailed discussion. 

•J~ un U J~\ mset U J~Sq U 3~ ms U J~su • 

Reasoning problems themselves are specified as en- 
tailment problems in the fluent calculus. For the Grip- 
per class we obtain the entailment problem 

T |= (3s) holds (at(B 1 ,B), s) A ... A holds (at(B n , B),s). 

In general, reasoning about planning problems in the 
fluent calculus amounts to solving an entailment prob- 
lem of the form 

T |= (3z) [(3s) state(s) = z] A <S> G (z), 

where &g(z) is a goal formula with z as the only free 
variable. Such problems have a solution if we find a 
substitution a for z such that 

T |= [(3s) state(s) = za] (2) 



and 

T h $g(zct) . (3) 

It is sufficient to restrict our search to substitutions a 
which actually denote states of our reasoning problem, 
i.e., substitutions which contain solely bindings of vari- 
ables of sort state to constructor state terms. Such 
substitutions arc called constructor state substitutions. 
In the sequel, a will always denote a constructor state 
substitution. 

The main idea of our algorithm presented in 
(Holldobler & Storr 1999; 2000) is to calculate succes- 
sively a sequence {Z,- L \ i > 0) of sets of solutions to 
(2) which correspond to the sets of states reached after 
executing 0, 1, 2, ... actions starting in the initial 
state, until a state is found which is a goal state, i.e. 
the corresponding substitution fulfills (3), or, if no new 
states are reached, in which case there is no plan. The 
implementation of this algorithm is done by represent- 
ing these sets as well as the relation between them by 
means of binary decision diagrams (BDDs) . 

Binary Decision Diagrams 

The idea of BDDs is similar to decision trees: a Boolean 
function is represented as a rooted acyclic directed 
graph. The difference to decision trees is that there 
is a fixed order of the occurrences of variables in each 
branch of the diagram, and that isomorphic substruc- 
tures of the diagram are represented only once. 3 This 
can lead to exponential savings in space in comparison 
to representations like decision trees or disjunctive or 
conjunctive normal form. 

We will introduce BDDs via an example. A formal 
treatment of BDDs is out of the scope of this paper and 
we refer the interested reader to the literature (see e.g. 
(Bryant 1986; Clarke, Grunberg, & Long 1994; Burch 
et al. 1992)). 

Consider the propositional logic 
formula (a A b) V (c A d) . Using the 
variable ordering a < b < c < d a 
BDD representation of this formula 
is given in the figure to the right. For 
a given valuation of the propositional 
variables a, b, c and d the value of 
the Boolean function represented by 
the BDD is obtained by traversing 
the diagram starting from the root 
and taking at each node the edge la- 
beled with the value of the variable 
occurring in the node. 

The label of the terminal node defines the value of 
the function under the current valuation. For exam- 
ple (a \— > _L, b i — ► _L, c i — ► T, d i — ► _L) leads to a node la- 
beled _L , i.e., the value of the formula is _L wrt this 
valuation. 




Bryant has shown in (Bryant 1986) that, given a fixed 
variable order, every Boolean function is represented by 
exactly one BDD. Moreover, propositional satisfiability, 
validity and equivalence problems are decidable over 
BDDs in linear or constant time. Of course, the com- 
plexity of the mentioned problems does not go away: 
the effort has been moved to the construction of the 
BDDs. But as Bryant has shown as well, there are ef- 
ficient algorithms for logical operations, substitutions, 
restrictions etc. on BDDs, whose cost is in most cases 
proportional to the size of its operands. BDDs may be 
used as a theorem prover, i.e., by construction of a BDD 
corresponding to a logical formula, and check the BDD 
for interesting properties, but more often they are used 
as an implementation tool for algorithms which are se- 
mantically based on Boolean functions or, equivalcntly, 
propositional formulae, or, via the characteristic func- 
tions, sets. In the implementation these formulae or 
sets are always represented as BDDs. The use of BDDs 
in this paper follows this spirit. 

The Algorithm 

The algorithm for solving entailment problems in the 
fluent calculus follows in spirit the algorithm to find 
reachable states in finite state systems as presented e.g. 
in (Burch et al. 1994). As mentioned, the aim is to 
find the sets Zi of solutions for (2) representing states 
which can be reached from the initial state after the 
execution of i actions. The first crucial question to 
tackle is how to represent these sets using BDDs. 

Each solution to (2) is a constructor state substitu- 
tion {z/t} with a term t of the form o f x o . . . o f n , 
where the fi's are pairwise distinct. On first glance 
it seems impossible to represent substitutions by finite 
BDDs because there are infinitely many terms. Fortu- 
nately, however, if there are only finitely many fluents 
then there are also only finitely many terms t such 
that {z/t} satisfies (2) due to T ms . Furthermore, be- 
cause o is an ACl-symbol in the fluent calculus we do 
not have to distinguish between terms which are equiv- 
alent under the AC1 equational theory. In other words, 
a term t occurring in the codomain of a constructor 
state substitution is uniquely characterized by the set 
of fluents occurring in t . 

This observation opens a possibility for encoding sets 
of solutions for the entailment problem in the fluent cal- 
culus into a BDD: for each of the finitely many fluents 
/ which may occur in the binding for a variable z in 
a constructor state substitution we introduce a propo- 
sitional variable Zf . A constructor state substitution 
(j = {z/t} is represented by a valuation Bs(v) for 
these variables such that Zf is mapped to T by Bs{o~) 
iff / occurs in t . 4 Hence, a set S of constructor state 
substitutions is represented by a set of valuations. The 



3 Thus, the BDD is ordered and reduced, also called 
ROBDD. These properties are so useful that they are re- 
quired in almost all BDD applications, so many authors 
include these properties into the definition of BDDs. 



4 A substitution containing more than one binding is rep- 
resented similarly: for each variable in the domain of the 
substitution we introduce a separate set of propositional 
variables which encodes the binding of that variable. 



set of valuations itself is represented by a propositional 
formula Z such that the set of models for Z is the 
set of valuations. Finally, Z is represented by a BDD. 
For example, if the alphabet underlying the fluent cal- 
culus contains precisely the fluent symbols a, b and c , 
then a substitution a = {z/aoc} is represented by a 
valuation as follows: 

a = {z/ ao c } 

&s{o) = { z a i-> T, z b i-> JL, 2 C i > T } , 

and set {{z/aoc}, {z/cob}} is represented by the for- 
mula (z a A -iZfc A z c ) V (-iz a A Zb A Z c ) . 

Before we return to the application of BDDs, let us 
first consider the process of calculating the sequence 
[Zi | i > 0) . Zq can be immediately derived from 
<&i(z) . But how can we compute Z i+1 given Zi and 
T su ? In order to answer this question we define 

T Ha) (z, z') = [A(z) A z' o d- = z o 0+] . (4) 

for each state update axiom <&(a) G J-" su of the form 

A(state(s)) — » state(do(a, s) o $~ = state(s) o i9 + 

Furthermore, for the set T su we define 

T(z,z')= \/ T 0( a )(^^') • ( 5 ) 

This definition is motivated by the following result, 
whose proof can again be found in (Holldobler & Storr 
1999) 

Lemma 1. Let t and t' be two constructor state 
terms and T |= state(s) = t . Then, 

T \= state(do(a, s)) = t' iff 

Fun U Tmset \= T^( a ) (t, t' ) for some 0(a) G J" s „. 

Applying this lemma we are able to characterize our 
sequence (Zi | i > 0) without implicitly referring to 
T su by the use of state : 

Z n = {a\F\=Z n {za)},n>0. (6) 

where 

Z„(z) =*/(*) (7) 

Z J+1 (z) = (3z') (Z,(z') A T(z',z)) . (8) 

The crucial point of our application of methods and 
techniques based on BDDs to reasoning in the fluent 
calculus is the following: We could identify a class F 
of formulae over the alphabet underlying the fluent cal- 
culus and a transformation B mapping each F G F 
to a propositional logic formula B(F) such that (i) the 
class is expressive enough to represent interesting en- 
tailment problems wrt the fluent calculus (namely, it 
contains the formulae like Zj defined in (7) and (8)) 
and (ii) the following result holds: 

Lemma 2. Let F G F U {$i(z), &g(z)} and a a 
constructor state substitution such that Fa does not 
contain any free variables. Then, 

Tun U Tmset \= Fa iff B s {a) \= B(F). 



The axiom set T U n U T m set contains the basic equa- 
tional theory behind the fluent calculus fragment used 
in this paper, and describes the semantics of o . The 
precise definition of F and B as well as the proof 
of this lemma is beyond the scope of this paper and 
we refer the interested reader to (Holldobler & Storr 
1999) or (Holldobler & Storr 2000) for the details. 
The class F is subset of all fluent calculus formu- 
lae consisting of restricted forms of equations of type 
state = state without use of function symbol state , 
as well as boolean combinations of these and a restricted 
form of existential quantification over state variables. 

Applying the translation B to the sequence of for- 
mulae (Zi(z) | i > 0) we obtain a procedure for cal- 
culating the sequence (Zi \ i > 0) as follows. Let 
{/lj • ■ • :fn} be the finite set of fluents in the alpha- 
bet underlying the fluent calculus. Furthermore, let 
F[zi,... , z n ] denote a propositional logic formula F 
built over the propositional variables Z\, ■ ■ ■ , z n ■ The 
sequence (Zi \ i > 0) of propositional logic formulae 
corresponding to (Zi \ i > 0) is defined by 

Z [z\ = B($/(z)) (9) 
Z l+1 [z'} = (3z) Zi[z\ A B(T(z, z'))[z, z'], (10) 

where z is the vector Zf t , . . . , Zf n of propositional vari- 
ables used to encode z and (3z) F is an abbreviation 
for (3zi ) ...(3z n ) F with 

(3zi) F = F{ Zl /L} V F{ Zi /T} . 

The propositional formulae (Zi | i > 0) are exactly 
the translations of the fluent calculus formulae (Zi | 
i > 0) by means of B , and thus, because of lemma 2, 
representations of the sequence of sets (Zi i > 0) . 

From (9) and (10) the so called forward pass of our 
planning algorithm for computing the sequence (Zi | 
i > 0) can be derived: 

1. Define Zq , in form of the BDD-representation of 
Zq , such that it contains only the initial state of the 
reasoning problem. 

2. Recursively calculate Zi + i , in form of the 
BDD-representation of Z<+i , based on Zi and 
B(T(z, z 1 )) , until either Zi overlaps with the set G 
of goal states, in which case the reasoning problem is 
successfully solved or no new states are generated, in 
which case the reasoning problem is unsolvable. 




Zq Zi Z2 Zk 



Figure 1: The forward pass of our algorithm. After fe steps 
the sets Zk and G overlap. 



The algorithm is illustrated in Fig. 1. Starting from 
the initial state all reachable states are generated. The 
algorithm terminates as soon as this set of states over- 
laps with the set of goal states or can no longer be ex- 
panded. The inclusion of a special noop Action, which 
leaves the state unchanged, ensures that the sets con- 
secutively grow larger such that we know that all states 
have been visited iff Zi = Z i+1 . Alternatively to the 
inclusion of a noop Action, one can check for cycles 
in the sequence of sets and whether the sets become 
empty. 5 In either case the algorithm is guaranteed to 
terminate after at most 2™ — 1 steps, where n denotes 
the number of fluents, since the number of reachable 
states is at most 2™ and thus the length of the shortest 
plan can be at most 2" — 1 (such that every state is 
visited once). 

If the forward pass terminates successfully, then in 
a second step a shortest plan is constructed. This is 
done by choosing a state from G (~l Zk and searching 
for a chain of states through which this state can be 
reached from the initial state. This is done by iterating 
backwards through the sets Zi generated by the for- 
ward pass algorithm. Because this second step is a com- 
putationally (relatively) inexpensive part, we refer the 
interested reader to (Holldobler & Storr 1999), where 
also the soundness and completeness of the combined 
algorithm is established. 

Optimizations 

The planning approach described above is an implicit 6 
breadth first search. In each single step we search the 
whole breadth of the search tree in depth i . The sets 
Zi can get quite complex and their BDDs quite large. 
Even more so, the size of the BDD for B(T(z, z')) , 
which describes the relation between the Zi , can 
quickly become too large to be handled in a graceful 
manner. Our approach shares this problem with re- 
lated model checking algorithms. Thus, a number of 
techniques were invented to limit a potential explosion 
in its size. In the sequel some of these techniques and 
their effects are discussed. 

Variable Order 

It is well known that the variable order used in a BDD 
has a large influence on the size of the BDD. Unfortu- 
nately it is still a difficult problem to find even an near 
optimal variable order. 7 Often, a good and acceptable 
variable order is found by empiric knowledge and ex- 
perimentation. A general rule is to group variables viz. 

5 When applying frontier simplification, as discussed 
later, the set Zi becomes empty when all states have been 
considered. 

6 It is called implicit because the calculated sets of states 
are never explicitly enumerated, but represented as a whole 
by a BDD, whose size depends more on the structure of the 
set, than on its actual size. 

7 The problem to find the optimal variable order is NP- 
complete. 



fluents, which directly influence each other, together. 
In particular, the variables Zf and z'j occurring in 
B(T(z, z')) should be ordered next to each other order. 
But how should these variable groups be arranged? An 
ordering we call sort ordering led to good results in 
several reasoning problems (see Tab. 1). The idea un- 
derlying the sort ordering is to group fluents by their 
arguments. For example, in the Gripper class the flu- 
ents at(B 1 ,A), at(B 1 ,B), carry(B 1 , d), carry(B 1 , G 2 ) 
should be grouped together, because they share the 
argument B\ . Remember that the fluent calculus is 
sorted. The sort ordering works as follows. First one 
considers the argument of each fluent which belongs to 
the largest sort and sorts the fluents according to this 
argument. The remaining ambiguities are resolved by 
considering the argument of the second largest sort and 
so forth as well as the leading function symbol. For 
some domains Tab. I shows some almost dramatic im- 
provements in the size of the BDDs for sort ordering 
if compared to a simple lexical ordering. The latter re- 
sults in grouping fluents with the same leading function 
symbol together. For some domains, however, there is 
little or no improvement; this is usually the case when 
there are no large sets of objects as parameters for flu- 
ents. 



Problem 


Gripper 


Blocksworld 


GET- PAID 




(20 Balls) 


(8 Blocks) 




lexical 


217409 


206995 


25633 


sort ordered 


3087 


23373 


38367 



Table 1: The Size of the BDD for B(T(z,z')) with an or- 
dering of the variables by name (lexical) or with the sort 
ordering heuristic. The problems are from the planning 
problem repository (McDermott 1999). 



Partitioning of the Transition Relation 

The maximal size of a BDD is exponential in the num- 
ber of propositional variables it contains. Thus, the 
BDD representing B(T(z, z')) , which contains twice as 
many propositional variables as the BDDs representing 
the Zi , is prone to get very large. A way to reduce 
this problem is to divide the disjunction T(z, z') into 
several parts Ti, . . . , T„ , which correspond to subsets 
of the state update actions. 

Let F S u,i, >•••) Fsu,k be a partition of T su and 
define for all 1 < i < k 

Ti(z,z')= \/ T 0(a) (z,z') 

*(a)£J r «.,i 

fc 

such that T(z,z') = \J T,(z,z') . Thus, (10) is mod- 

i=l 



ified to 
Z i+ i[z'] = 

k 

\J(3z) (Zi[z\ A B(T k (z,z'))[z,z'}) . (11) 

i=l 

Fig. 2 illustrates the partitioning of the transition re- 




Figure 2: The partitioning of the transition relation. Each 
of the codomains of Ti , T 2 , T 3 and T 4 is significantly 
smaller than the codomain of T . 

lation. 

The positive effect of the partitioning is that the ac- 
tions in each subset effect only a subset of all fluents. 
Because the maximal size of a BDD is exponential in 
the number of propositional variables, the sum of the 
sizes of the BDDs corresponding to the partition may be 
significantly smaller than the size of the original BDD. 

In our implementation the number of partitions is 
adaptive: first the BDDs B^^^ (z, z')) for every sin- 
gle action are constructed, then they are combined un- 
til a parameter "partition treshold" is exceeded. In the 
experiments, partitioning led to a reduction of needed 
memory in most of the tested problems as shown in 
Fig. 4 at the end of the paper. 

On the other hand, a reduction in memory size does 
not necessarily lead to a reduction in calculation time 
as the results depicted in Fig. 3 indicate. According 
to equation (11) the various parts of the partitioned 
transition relation have to be put together, and this 
takes time. Nevertheless splitting can be useful even 
if the computation time increases, because of the re- 
duction of the needed memory to store the BDDs. For 
example in the case of MPRiME-x-1 the problem was 
not manageable under our memory constraints without 
partitioning the transition relation. 

The idea to partition BDDs can also be applied to 
the BDDs representing Zi . We have not yet explored 
this idea, because in our test problems these BDDs were 
only moderately large (i.e., up to 100.000 nodes). 

We have also implemented an optimization tech- 
nique called frontier simplification (Clarke, Grunberg, 
& Long 1994). This technique explores the fact, that 
the algorithm for solving the entailment problem in the 
fluent calculus works also if the following two conditions 
are enforced for all i > : 

• The set Zi contains all states which may be reached 



by executing i actions, but not by executing less 
than i actions. 

• the set Zi does not contain any states which cannot 
be reached by executing at most i actions. 

The sets Zi can be chosen freely within these limita- 
tions. Hence, it is desirable that the algorithm chooses 
the Zi such that their BDD representations are as 
small as possible. In our experiments frontier simpli- 
fication lead to moderate improvements (i.e. up to 40 
improved computation times, because the time saved 
for the computation of the recursion equation (10) was 
outweighted by the additional effort spent for the re- 
duction of the BDDs. 

Results on the Gripper Class 

The problems of the Gripper class were quite hard 
problems for the planners taking part in the AIPS98 
competition. Their difficulty is rooted in the combina- 
torial explosion of alternatives due to the existence of 
two grippers. In Fig. 5 the runtimes of these planners 8 
are compared to our system, BDDplan. 9 Only one 
planner (HSP) was able to solve all of the problems of 
this class, but it generated only suboptimal plans by 
using only one of the two grippers, whereas BDDplan 
generates the shortest possible plan by design. 

Discussion 

We have presented in this paper our preliminary find- 
ings in applying BDD techniques as an implementation 
tool for reasoning about situations, actions and causal- 
ity in the fluent calculus, and discussed several tech- 
niques that have been successfully used to improve the 
performance of the implementation. 

We tested our implementation using the problems 
of the planning contest on AIPS98 and have received 
mixed results so far. As discussed in section , our plan- 
ner performed very good in the Gripper class: It was 
able to provide the shortest solutions to even the most 
difficult problems posed in this class, whereas the plan- 
ners which have participated in the competition were 
only able to solve but the simplest problems or, in the 
exceptional case of HSP, provided sub-optimal solu- 
tions ignoring the second gripper of the robot. In some 
other problem classes, however, our implementation did 
not outperform existing systems. On the other hand, 
we have just started to investigate optimization tech- 
niques and will continue to do so in the future. 

At present, our algorithm (described in more detail 
in (Holldobler & Storr 2000; 1999)) is closely related to 
model checking algorithms (Burch et al. 1992) which 
perform symbolic breadth first search in the statespace. 

8 See http://ftp.cs.yale.edu/pub/mcdermott/aipscomp- 
results.html. 

9 The runtime of BDDplan is measured on a different 
machine, so the comparison is only accurate up to a constant 
factor. 



It generates a series (Zi \ i > 0) of propositional for- 
mulae represented as BDDs, which encode the set of 
answer substitutions a — {z/t} for the fluent calculus 
formulae T \= (3(aj)i<j<„) zo = state(a n . . . aiSo) , 
which represents sets of states t reachable after the 
execution of a sequence of actions (ai)i<i<„ of length 
n , until there is a goal state among the stales encoded. 
The formulae Zi are generated recursively by apply- 
ing the propositional encoding of a transition relation 
T(z,z'). 

The optimization techniques presented in this paper 
do not change the principle of breadth first search the 
algorithm is based on. This has the pleasant effect that 

• the algorithm is complete in the sense that it always 
either finds the shortest plan or is able to prove that 
there is no plan, and 

• it is possible to reuse the results of the computation- 
ally intensive forward pass stage, in which the se- 
quence of sets of reachable states (Zi \ i > 0) is 
constructed, to either create many solutions to the 
same reasoning problem or to solve multiple reason- 
ing problems with the same initial state. 

On the other hand, in order to speed up the search 
it seems one should give up the concept of breadth first 
search and explore interesting parts of the search space 
first. This can be done without giving up completeness 
by stepwise adding actions to the transition relation, 
which seem heuristically relevant for reaching the goal, 
and explore the subtrees of the search space generated 
by these actions first. This concept is similar to ab- 
straction in planning (Knoblock 1994) and is topic of 
future research. 

It should be noted that although we have presented 
our algorithm in such a way that there is only a sin- 
gle initial state (i.e., the set Z$ is unitary), the algo- 
rithm itself is not restricted to this case. If the initial 
situation is only incompletely specified then there are 
several initial states, which leads to a set Z contain- 
ing more than one element. However, a straightforward 
application of the algorithm to such a non-unitary set 
Z would result in a "brave" reasoning process in the 
sense that the plan generated works for at least one of 
the initial states, but is not guaranteed to work for the 
others. 

There is a number of approaches in planning that are 
based on propositional logic (Weld 1999). Many of the 
most successful are rooted in the planning as satisfia- 
bility (Kautz & Selman 1996) and Graphplan (Blum & 
Furst 1997) or both. Our algorithm is similar to Graph- 
plan in that it builds up a data structure for each level, 
which describes the states reachable after the execution 
of n actions, (though Graphplan admits the parallel ex- 
ecution of multiple actions in a time step if they do not 
interfere.) Unlike Graphplan, that gives only an upper 
bound of the the set of states reachable by its mutex 
mechanism, our algorithm computes an exact symbolic 
representation of this set. Consequently, the plan ex- 
traction process is deterministic and no backtracking is 



needed. 

In contrast to algorithms based on planning as satisfi- 
ability (SATPLAN) and Graphplan the algorithm pre- 
sented here is not limited to the generation of polyno- 
mial length plans and is complete. On the other hand, 
each time step may take space exponential space, since 
the maximum size of BDDs is 0(2 n ) for n propo- 
sitional variables. However, the experimental results 
achieved so far indicate that in practice the BDDs are 
much smaller than the theoretical limit. 

Still, the size of the encountered BDDs is the main 
problem limiting the scalability of the algorithm and is 
an topic of further research. Since the maximum size 
of BDDs is exponential in the number of propositional 
variables, the reduction of this number is a foremost 
concern. By design our algorithm avoids the unfold- 
ing of all time steps of the plan into disjunct sets of 
propositional variables, as in the case of SATPLAN and 
Graphplan, since all time steps are treated separately. 
Moreover we are able to omit the variables encoding 
actions easily, since we are not restricted to a clausal 
form of the formulas we are working with, and the ac- 
tions can be reconstructed from the sequence of states. 
The encoding we use at present is "naive" in the sense 
that each fluent corresponds to a single propositional 
variable. We assume that the use of domain dependent 
properties of fluents provides a large space for improve- 
ments, as discussed in (Edclkamp & Hclmert 1999) for 
the BDD based planning system Mips, which is used to 
explore automated generation of efficient state encod- 
ings for STRIPS/ ADL/PDDL planning problems and 
the implementation of heuristic search algorithms with 
BDDs. 

Depending on the task, it seems to be inevitable to 
encode the actions in the case of non-deterministic do- 
mains, as in the work of (Cimatti, Roveri, & Traverso 
1998). Their system generates so-called universal 
plans, which consist of a state-action table that con- 
tains for each state the action, which leads to the goal 
in the shortest way This approach opens new possi- 
bilities in generation of plans for non-deterministic do- 
mains. However, considering the case of deterministic 
domains, we conjecture, that this approach is limited 
to less complex reasoning problems in comparison to 
state-only encodings, because, additional to the states 
before and after the execution, the executed actions 
have to be encoded into the transition relation as well. 
This leads to a considerable increase in the number of 
propositional variables and, consequently, in the maxi- 
mal size of the BDDs. But we have not yet performed 
direct comparisons to bolster this conjecture. 

The translation used in our approach to map flu- 
ent calculus entailment problems to propositional logic 
is tailored to a specific class of fluent calculus formu- 
lae, which is just large enough to specify the consid- 
ered class of planning problems. However, it seems 
likely that there is a more general way to translate 
the formulas of a larger fragment of the fluent calcu- 
lus while keeping the restriction to propositional flu- 



ents, such that we could introduce recent work on 
the fluent calculus like ramification (Thiclscher 1998a; 
1998b) into our planner without modifying the transla- 
tion and the proofs. The concept of ramification within 
the fluent calculus involves a limited use of constructs 
of second order logic, namely a calculation of the tran- 
sitive closure of a relation over states, but this does not 
seem to pose a difficult problem as the set of states is 
finite and there are algorithms to compute this tran- 
sitive closure using BDDs (Clarke, Grunbcrg, & Long 
1994). 

To sum up, our BDD based implementation shows 
some promising initial results but it is too early to com- 
pletely evaluate it yet. 
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Figure 3: Effects of the parameter "partitioning threshold" on the calculation time for several problems. The time 
is relative to the time taken when no partitioning is done. 
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Figure 4: The sum of the sizes of the BDDs used to represent the transition relation in dependence on the parameter 
"partitioning threshold" . 
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Figure 5: Runtimes of different planners on the Gripper problem (in milliseconds) with different numbers of balls. 
Planners marked with opt provided optimal (i.e. shortest) plans, planners marked with -adl work on the sorted 
version of the domains, the others on the STPJPS-version. 



